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Abstract 

This paper proposes a general duality framework for the problem of 
minimizing a convex integral functional over a space of stochastic pro- 
cesses adapted to a given filtration. The framework unifies many well- 
known duality frameworks from operations research and mathematical 
finance. The unification allows the extension of some useful techniques 
from these two fields to a much wider class of problems. In particular, 
combining certain finite-dimensional techniques from convex analysis with 
measure theoretic techniques from mathematical finance, we are able to 
close the duality gap in some situations where traditional topological ar- 
guments fail. 

1 Introduction 

Let (H, J 7 , P) be a probability space with a filtration {Tt)J =0 (an increasing 
sequence of sub-sigma-algebras of J 7 ) and consider the problem 

minimize Ef(x(uj),u(uj)^uj) over x G TV (1) 

where / is an extended real-valued function, Af is a space of (J- t)T=o~ adapted 
decision strategies and u is a measurable function (exact definitions will be given 
below). The variable u represents parameters or perturbations of a dynamic 
decision making problem where the objective is to minimize the expectation 
over decision strategies adapted to the information available to the decision 
maker over time. This paper derives dual expressions for the optimal value 
of |T|) by incorporating some measure theoretic techniques from mathematical 
finance into the general conjugate duality framework of Rockafellar 36 . 

Problem ([T]) covers many important optimization models in operations re- 
search and mathematical finance. Specific instances of stochastic optimization 
problems can often be put in the above format by appropriately specifying the 
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integrand /. Allowing the integrand / to take on the value +00, we can repre- 
sent various pointwise (almost sure) constraints by infinite penalties. Some of 
the earliest examples can be found in Danzig jTD] and Beale [J]. Problem (fTJ) 
provides a very general framework also for various optimization and pricing 
problems in mathematical finance. Certain classes of stochastic control prob- 
lems can also put the above form; see [101 Section 6]. In some applications, the 
parameter u is introduced into a given problem in order to derive information 
(such as optimality conditions or bounds on the optimal value) about it. This is 
the point of view taken e.g. in |36j . In other applications, the parameter u has a 
natural interpretation in the original formulation itself. Examples include finan- 
cial applications where u may represent the payouts of a financial instrument 
such as an option and one is trying to minimize the initial cost of a hedging 
portfolio. 

Convex duality has widespread applications in operations research, calculus 
of variations and mechanics. Besides in deriving optimality conditions, duality 
is used in numerical optimization and bounding techniques. The essence of 
convex duality is beautifully summarized by the conjugate duality framework 
of [36] which subsumes more special duality frameworks such as Lagrangian 
(and in particular LP) and Fenchel duality; see also Ekeland and Temam [IB] . 
Several duality results, including optimality conditions for certain instances of 
(fTJ) have been derived from the conjugate duality framework in Rockafellar and 

Wets IsHEHiDllH]. 

Convex duality has long been an integral part also of mathematical finance 
but there, duality results are often derived ad hoc instead of embedding a given 
problem in a general optimization framework. Attempts to derive financial du- 
ality results from known optimization frameworks are often hindered by two 
features. First, general duality frameworks are often formulated in locally con- 
vex topological vector spaces while in financial problems the decision strategies 
are usually chosen from a space that lacks an appropriate locally convex topol- 
ogy. Second, general duality results are often geared towards attainment of the 
dual optimum which requires conditions that often fail to hold in financial ap- 
plications. The main contribution of this paper is to propose a general enough 
duality framework for (fTJ) that covers several problems both in operations re- 
search as well as in mathematical finance. Our framework, to be rigorously 
specified in Section [21 is an extension of the stochastic programming duality 
frameworks proposed in [38l [40] . In our framework the parameters u enter the 
model in a more general manner and we do not restrict the decision strategies 
x to be bounded or integrable a priori. 

Allowing strategies to be general adapted processes has turned out be useful 
in deriving various duality results for financial models; see e.g. Schachermayer 
and Delbaen [2], Kabanov and Safarian [22] and their references. This paper 
extends such techniques to a much more general class of models. We obtain 
dual representations for the optimal value of ([T]) but not necessarily the dual 
attainment as opposed to the strong duality results in [38l [39l [40l [41] . Conse- 
quently, we cannot claim the necessity of various optimality conditions involving 
dual variables. Nevertheless, the mere absence of duality gap is useful in many 
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situations e.g. in mathematical finance where the "constraint qualifications" re- 
quired for classical duality results often fail to hold. For example, various dual 
representations of hedging costs correspond to the absence of the duality gap 
while the dual optimum might not be attained. As an application, we extend 
certain results on superhedging and optimal consumption to a general market 
model with nonlinear illiquidity effects and convex portfolio constraints. This 
will be done by extending the elegant (currency) market model of Kabanov [2 3) 
where all assets are treated symmetrically. More traditional market models are 
then covered as special cases. The absence of duality gap is useful also in de- 
riving certain simulation-based numerical techniques for bounding the optimum 
value of (HJ) as e.g. those proposed in Rogers [33] and Haugh and Kogan [T!5] in 
the case of optimal stopping problems. We extend such techniques for a more 
general class of problems. 

The rest of this paper is organized as follows. Section [5] presents the gen- 
eral duality framework for problem ([1} based on the conjugate duality frame- 
work of [36] . Sections [3] and 0] give some well-known examples and exten- 
sions of duality frameworks from operations research and mathematical finance, 
respectively. Section [5] extends some classical closedness criteria from finite- 
dimensional spaces to the present infinite-dimensional stochastic setting. 

2 Conjugate duality 

We study ([1]) in the conjugate duality framework of Rockafellar |36) . However, 
we deviate from [36] in that the space AT of decision variables need not be a 
locally convex topological vector space paired with another one. This precludes 
the completely symmetric duality in |36] but in some situations it yields more 
regularity for the optimal value than what can be obtained e.g. with integrable 
strategies. 

For given integers nt, we set 

M={{x t )^ \x t ei°(0,Ji,P;R"*)}, 

where L°(£l, .Ft, P; R n *) denotes the space of equivalence classes of ^-measurable 
R™ 4 -valued functions that coincide P-almost surely. Each xt is interpreted as 
a decision that is made after observing all available information at time t. In 
applications, the filtration (F"t)|l is often generated by a finite-dimensional 
stochastic process whose values are observed at discrete points in time. If Fo is 
the trivial sigma algebra {0, £1} then the first component xq is deterministic, 

The function / is assumed to be an extended real-valued convex normal in- 
tegrand on R™ x R m x where n = n + . . . + nx and m is a given integer. This 
means that the set-valued mapping u i— > epi/(-, -,u)) is ^-measurable and it has 
closed and convex values (so (x, u) i— > f(x, u, w) is convex and lower semicontinu- 
ous for every u); see e.g. @U Chapter 14] . This implies that / is B(R n x R m )<g) F- 
measurable and that the function (x, u) i— > f(x, u, u) is lower semicontinuous and 
convex for every u>. It follows that uj \- > f(x(uj),u(Lu),Lu) is ^-measurable for 
every x e L°(Q, F, P; R") and u G L°(Cl, P; R m ). Throughout this paper, 
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the expectation of an extended real-valued measurable function is denned as 
+00 unless the positive part is integrable. The integral functional 

If(x,u) := Ef(x(uj),u(uj),uj) 

in the objective of ([1]) is then well-defined extended real- valued convex function 
on L°(Q, I, P; R n ) x L°(f2, J 7 , P; M m ). Normal integrands possess many useful 
properties and they arise quite naturally in many optimization problems in 
practice. Examples will be given in the following sections. We refer the reader 
to [37] or |HJ Chapter 14] for general treatment of normal integrands on R d x f2 
for finite d. 

For each u e L°(fi, J 7 , P; K m ), the optimal value of (JTJ is given by the waZue 
function 

tp{u) := inf If(x,u). 

x£j\f 

By |36l Theorem 1], y> is convex. We will derive dual expressions for tp on the 
space LP := L P (Q, F, P; K m ) using the conjugate duality framework of Rock- 
afellar [36] . To this end, we pair LP with L q , where q G [l,oo] is such that 
l/p + 1/q = 1. The bilinear form 

= • y(w)] 

puts L p and L q in separating duality. The weakest and the strongest locally con- 
vex topologies on LP compatible with the pairing will be denoted by o-(L p ,L q ) 
and T(L p ,L q ), respectively (similarly for L q ). By the classical separation argu- 
ment, a convex function is lower semicontinuous with respect to a(L p ,L q ) if it 
is merely lower semicontinuous with respect to r(L p ,L q ). 

Remark 1. For p S [l,oo), T(L p ,L q ) is the norm topology and o~(L p ,L q ) is 
the weak-topology that LP has as a Banach space with the usual LP -norm. For 
p = oo, o~(L p ,L q ) is the weak* -topology that LP has as the Banach dual of 
L q while t(L p , L q ) is, in general, weaker than the norm topology. Lt follows 
from the Mackey-Arens and Dunford- Pettis theorems, that a sequence in L°° 
converges with respect to t(L°°, L 1 ) if and only if it norm-bounded and converges 
in measure; see Grothendieck U81 Part 4] for the case of locally compact measure 
spaces. Ln mathematical finance, a convex function on L°° is sometimes said to 
have the "Fatou property" if it is sequentially lower- semicontinuous with respect 
to TfX 00 ,^ 1 ). 

Remark 2. Instead of LP and L q , we could take an arbitrary pair of spaces 
of measurable R m -valued functions which are in separating duality under the 
bilinear form (u,y) — E[u(uS) ■ y(ui)]- Examples include Orlicz spaces which 
have recently been used in a financial context by Biagini and Frittelli J^j. 

The conjugate of a function ip on LP is the convex function on L q defined by 
(p*(y) = sup{(u,y) -<p(u)}. 

u£Lp 
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The conjugate of a function on L q is denned similarly. It is a fundamental result 
in convex duality that p** = clip where 



dp — 



\sc<p if (he <p)(u) > — oo Vu e L p , 
-co otherwise 



is the closure of tp; see e.g. [361 Theorem 5]. Here lsc tp denotes the lower 
semicontinuous hull of <p. If lsc Lp has a finite value at some point then lsc p> is 
proper and lscy = c\p] see [361 Theorem 4]. 

The Lagrangian associated with (fTJ) is the extended real-valued function on 
N x L q defined by 

L(x,y) = inf {If(x,u) - (u,y)}. 

The Lagrangian is convex in x and concave in y. The dual objective is the 
extended real-valued function on L q defined by 

g(y) = inf L(x,y). 

Since g is the pointwise infimum of concave functions, it is concave. The basic 
duality result [36] Theorem 7] says, in particular, that 

9 = ~<P ■ 

This follows directly from the above definitions and does not rely on topological 
properties of TV. The biconjugate theorem then gives the dual representation 

(c\ip)(u)=aup{(u,y)+g(y)}. (2) 

In many applications, the parameter u has practical significance, and the 
dual representation may yield valuable information about the function <p. 
On the other hand, in some situations, one is faced with a fixed optimization 
problem and the parameter u is introduced in order to derive information about 
the original problem. This is the perspective taken in [36] , where the minimiza- 
tion problem 

minimize If(x,0) over x 6 Af (3) 
would be called the primal problem and 

maximize g(y) over y 6 L q (4) 

the dual problem. By @, the optimum values of ([3]) and ^ are equal ex- 
actly when (c\p)(0) = (p(0). An important topic which is studied in [36] but 
not in the present paper is derivatives of the value function ip and the associ- 
ated optimality conditions. In this paper, we concentrate on the more general 
property of lower semicontinuity of ip; see Section [SJ The lower semicontinuity 
already yields many interesting results in operations research and mathematical 
finance. Moreover, lower semicontinuity is useful for proving the continuity of 
p for p < oo since a lower semicontinuous convex function on a barreled space 
is continuous throughout the interior of its domain; see e.g. [321 Corollary 8B] . 
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Remark 3. As long as the integral functional (x,u) H> If(x,u) is closed in u 
(which holds under quite general conditions given e.g. in Rockafellar JSSf), the 
biconjugate theorem gives 

If(x, u) = sup {L(x, y) + (u, y)} 

y£Ll 

and, in particular, lf(x,0) = sup y L(x, y) so that(p(0) = m£ X £js $up yeLq L(x,y). 
On the other hand, ([2]) gives (cl</?)(0) = sup^g^, mf xe _\f L(x, y) so that the con- 
dition (cly)(0) = 95(0) can be expressed as 

inf sup L(x,y) = sup inf L(x,y). 

In other words, the function L has a saddle-value iff tp is closed at the origin. 
Along with the general duality theory for convex minimization, the conjugate 
duality framework of \36^ addresses general convex- concave minimax problems. 

The following interchange rule will be useful in deriving more explicit ex- 
pressions for the dual objective g. It is a special case of [42] Theorem 14.60] 
and it uses the fact that for an ^-measurable normal integrand h, the function 
uj h-» ini u h(u,uj) is ^-measurable; see [UJ Theorem 14.37]. 

Theorem 1 (Interchange rule). Given an T -measurable normal integrand h on 
R k x O, we have 

inf Eh(u(uj),uj) — E inf h(u.oj) 

as long as the left side is less than +00. 

Theorem [T] yields a simple proof of Jensen's inequality. Throughout this 
paper, the conditional expectation of a random variable x with respect to J- t 
will be denoted by E t x; see e.g. Shiryaev [13 II. 7]. 

Corollary 2 (Jensen's inequality). Let h is an J r t -measurable convex normal 
integrand onR k xfl such that Eh*{v(uj), uj) < 00 for some v € L q {Vt, T t , P; K fe ). 
Then 

Eh((E t x)(uj),uj) < Eh(x(w),L)) 
for every x £ L p (fl, T, P; M fe ). 
Proof. Applying Theorem [1] twice, we get 

I h {E t x) = Esu V {v ■ (E t x)(uj) - h*(v,Lj))} 

v 

= sup E{v(uj) ■ (E t x)(u>) — h*(v(uj),Lu)} 
= sup E{v(ui) ■ x(uj) — h*(v(ui),uj)} 
< sup E{v{uj) ■ x(uj) — h*(v(uj), uj)} 
= Es\xp{v ■ x{uj) — h*(v, uj)} 

V 

= Eh(x(u), uj), 
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where the third equality comes from the law of iterated expectations; see e.g. [JTJ 
Section 11.71. □ 



Going back to Q]), we define 

l(x,y,ui) = inf {f(x,u,ui)-u-y}. 

This is an extended real- valued function on K™ x R m x ft, convex in x and 
concave in y. Various dual expressions in stochastic optimization and in math- 
ematical finance can be derived from the following result which expresses the 
dual objective in terms of I. In many situations, the expression can be written 
concretely in terms of problem data; see Sections [3] and H] Given an r G [1, oo], 
we let 

AT :=Mf\L r {Q.,P,T;W l ). 

Theorem 3. The junction u> H> 1(x(uj), y{u>), uj) is measurable for any x G TV 
and y G L q so the integral functional Ii(x,y) = El(x(ui),y(u>),u>) is well-defined 
on TV x L q . As long as If ^ +oo, we have 

g(y) = inf L(x,y) 
If, in addition, I is of the forn^ 

T 

l(x,y,w) = ^2h(x t ,y,uj) 
t=o 

for some B(R"') <8> B(M. m ) ® J 7 -measurable extended real-valued functions l t on 

R n t x R m x Q then 

g(y) = inf. Ii(x,y) 

xeN r 

as long as the right side is less than +oo. 

Proof. We have l(x(ui), y(ui), to) = —h*(y(uj),uj), where h(u, uj) := f{x{ui),u,ui). 
To prove the measurability it suffices to show that h* is a normal integrand on 
W n x f2. This follows from Proposition 14.45(c) and Theorem 14.50 of [4"2] . 

If If ^ +oo, then there exists an x G TV such that L(x, y) < oo for every 
y G L q . We can thus assume that L(x, y) < oo in the expression for g in which 
case 

L(x,y)=E inf {f(x(uj),u,ui) - u ■ y{ui)} = Ii{x,y), 

by Theorem [T] Here we apply the interchange rule to the function (u, uj) i— > 
f(x(uj),u,uj) which is a normal integrand, by Proposition 14.45(c)]. 

Fix a y G L q and let x G TV r be such that Ii(x,y) < oo. Let e > be 
arbitrary and let x' G TV be such that El(x'(u)),y(u)),ui) < g(y) + e. Defining 
x v t — x' t XA^ + XtXn\A^i where A" — {uj \x' t (uj)\ < v}, we have that the strategy 

1 Throughout this paper, we define oo — oo = +oo. 
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x" = (xt)f =0 is in N r and that x v t — > x' t almost surely for every t — 0, . . . , T 
as p/'oo. Since the functions uj \-> lt(x^(u>),y(uj),uj) are dominated by the 
integrable function 

u h-> max{l t (x' t ((j),y(u),tj)Jt(xt(u),y(u),u),0}, 

Fatou's lemma (applied in the product measure space Q x {0, . . . , T} obtained 
by equipping {0, . . . , T} with the counting measure) gives 

T T 

lim sup E ^2 k « ( w ) i y ( w ) > w ) < E ^2 lim sup lt ( x * ( w ) ' y ( w ) ' w ) 

T 

= f?53Zt(i't(w),l/(w),w) 

t=0 

< + £• 

Since e > was arbitrary and a; 1 ' G 7V r , the claim follows. □ 

The main content of the first part of Theorem [3] is that the infimum in the 
definition of the Lagrangian can be reduced to scenariowise minimization. This 
can sometimes be done even analytically. The last part of the above result shows 
that, while integrability of x may be restrictive in the original problem, it may 
be harmless in the expression for the dual objective g. A simple example will 
be given at the end of Example [T] below. In some applications, the integrability 
can be used to derive more convenient expressions for g. 



3 Examples from operations research 

This section reviews some well-known duality frameworks from operations re- 
search and shows how they can be derived from the abstract framework above. 
Many of the examples are from Rockafellar and Wets [38, 40 where they were 
formulated for bounded strategies. We will also point out some connections 
with more recent developments in finance and stochastics. A recent account 
of techniques and models of stochastic programming can be found in Shapiro, 
Dentcheva and Ruszczyski 46i. 

The best known duality frameworks involve functional constraints and La- 
grange multipliers. The most classical example is linear programming duality. 
These frameworks are deterministic special cases of the following stochastic pro- 
gramming framework from |40j . where sufficient conditions were given for the 
attainment of the dual optimum. 

Example 1 (Inequality constraints). Let 

) fo(x,w) if fj(x,w) + Uj <0 for j = l,...,m, 
I +oo otherwise, 
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where fj are convex normal integrands. To verify that f is a normal integrand, 
we write it as f = fo + Ej=i , where 



if (x,u) G Cj(u)), 
+00 otherwise 



6 Cj (x,u,uj) 



and Cj{uj) = {(x,u) \ fj{x,u) + uj < 0}. By JJ^ Proposition 14-33], the sets 
Cj are measurable so the functions 8c j are normal integrands by \42\ Exam- 
ple 14-32] and then f is a normal integrand by UM Proposition 14-44( c )]- The 
integral functional If is thus well-defined and equals 



I f (x,u) 



Ef (x(w),w) if fj(x(w),w) +u(u) < P-a.s. j 
+00 otherwise. 



The primal problem Q can be written as 

minimize E fo(x(uj) , lj) over x G TV 

subject to fj(x(uj),uj) < P-a.s., j = 1, . . . ,m. 

This is the classical formulation of a nonlinear stochastic optimization problem. 
It is a stochastic extension of classical mathematical programming models such 
as linear programming. 

The Lagrangian integrand becomes 

K x >y> UJ )= inf {f(x,u,u) - u ■ y} 



-00 



if fj(x, uj) = 00 for some j, 
= { fo{x,uj) + y ■ F(x,w) if f 3 (x,uj) < 00 and y > 0, 

otherwise, 

where F(x,u) = (fi(x, ui), . . . , f m (x, u>)). The expression g(y) = inf xe AT h (x, y) 
holds under the general condition of Theorem^ but to get more explicit expres- 
sions for the dual objective g one needs more structure on f ; see the examples 
below. 

To illustrate how the choice of the strategy space may affect the lower semi- 
continuity of if, consider the case n = m = 1. fo =0 and 

fi{x, u, uS) = a(uj)x + u, 

for some strictly positive a such that l/a ^ L . We get tp(u) — for every 
u G LP but there is no x G TV 1 which satisfies the pointwise constraint when 
ess inf u > 0. However, 



l(x,y,u) 



ya{uj)x ify>0, 
—00 otherwise, 



so, by the second part of Theorem [3J the strategies can be taken even bounded 
when calculating g. 
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It was observed in [301 Section 3 A] that the dual objective in Example Q] can 
be written in a more concrete form when the functions fj have a time-separable 
form. 

Example 2. Consider Example^ in the case 

T 
t=0 

where each fjj is an Tt -measurable normal integrand. Defining F t {xt,uj) — 
(fi,t{%t, w), . . . , f m ,t(xt, w)) and using the convention oo — oo = +oo, we can 
write 

T 

t=o 

where 



l t (x t ,y,oj) = < 



+oo if fj t (x t ,uj) = oo for some j, 

f .t(x t ,uj) + y ■ F t (x t ,u) if fj. t {x t ,u) < oo and y > 0, 
— oo otherwise. 



Assume now that F t (xt, •) € L p for every t and xt G K n * and that there is 
a v £ J\f p and a p-integrable random variable w such that fj <t (x,u>) > Vt(oj) ■ 
x - w{uj). It follows that F(x(-),-) <E L p for every x <E A/" 00 ; see e.g. £?7| 
Theorem 3K^. If there is an x £ Af°° such that oj h- fo,t(xt(u>), are integrable 
then, by the second part of Theorem [3J 



xGJV 00 — ' 



t=0 



Using the properties of conditional expectation (see e.g. Section II.7]), we 
get 



g(y)= inf Ey^E t l t (x t (uj),y(uj),uj) 

t=0 



inf BVyit^.^^.u) 
xeAf°° * — ' 

t=0 



Applying Theorem^ for t = 0, . . . ,T , we can express the dual objective as 

T 

g{y)^EY J 9t{{E t y){uo),uj), 



t=o 



2 If ||a;||£oo < r, there is a finite set of points x 1 G R J i = 1, . . . ,n whose convex combination 
contains the ball rB. By convexity, J^,t(*(w),w) < sup 4=1 /j t(a5*,w), where the right 
hand side is p-integrable by assumption. Combined with the lower bound, we then have 
Ft(x(-), ■) E L p as claimed. 
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where 

9t(y,u)= inf l t (x t ,y,u). 
The dual problem can thus be written as 

T 

maximize y^g t (y t (ui), u), 

y€M p * — ' 
t=0 

where Ai q is the set of M. m -valued q-integrable martingales. 

In the linear case, considered already in Danzig [10] . the dual problem in 
Example [5] can be written as another linear optimization problem. 

Example 3 (Linear programming). Consider Example^ in the case where 
f 0t t(x t ,uj) 



a 0> t(w)-x t i/iGR"', 
+oo otherwise 



and fjj(xt,Lu) = ajj(uj)-x t -\-bjj(uj) for Tt -measurable p-integrable nt- dimensional 
vectors aj t and T t -measurable integrable scalars bj :t - The primal problem can 
then be written as 

T 

minimize E ^""^ ao,t(u;) • x t (oj) over x G A/+ 
t=o 

T 

subject to ~^^[A t (uj)xt(uj) + b t (uj)} < P-a.s., 
t=o 

where A t {uj) is the matrix with rows ^^(w) and bt(uj) ~ (bj_ t (Lo))™=i- We get 

-oo if x t ~£_ 0, 

l t (x t ,y,uj) = { a Q>t (u) -x t +y [A t (u)x t +b t (u)] ifx t >0,y> 0, 
-oo otherwise 

-oo if x t t 0, 

= { i A t M?y + a o,tM] • x t + y ■ b t (uj) ifx t >0,y> 0, 
-oo otherwise, 



where At(tu) is the transpose of A t {uj). It follows that 



y-b t (u>) ify>0 and A*(uj)y + a , t (w) > 0, 
— oo otherwise 



and the dual problem can be written as 

T 

minimize E b t (uj) ■ yt(w) over y e M q + 
t=o 

subject to A* t {uj)yt{uj) + a ,t(u) > P-a.s. t = 0,...,T, 
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where Ai q + is the set of nonnegative q-integrable martingales. When T = 0, we 
recover the classical linear programming duality framework. 

The famous problem of optimal stopping is a one-dimensional special case 
of Example [3J 

Example 4 (Optimal stopping). The optimal stopping problem with an inte- 
grable nonnegative scalar process Z can be formulated as 

T T 

maximize E~^^xtZ t subject to Xt < 1, Xt G {0,1} P-a.s. 

x + t=o t=o 

The feasible strategies x are related to stopping times through t{uj) = inf {t \ Xt(io) = 
1}. The optimal value is not affected if we relax the constraint xt £ {0,1} (see 
below). The relaxed problem fits the framework of Example^ with n t = m = 1, 
p = oo, ao.t(w) = — Z t {uj), ai. t (w) = 1 and &i it (w) = — 1/(T+ 1). The dual 
problem becomes 

minimize Eyo subject to y > Z P-a.s.. 

yEM 1 

To justify the convex relaxation, we first note that the feasible set of the 
relaxed problem is contained in the space Af°° of bounded strategies. Since Z £ 
A/" 1 by assumption, it suffices (by the Krein-Millman theorem) to show that the 
feasible set of the relaxed problem equals the a(J\f°° ,Af 1 )-closed convex hull of 
the feasible set of the original problem. Let x be feasible in the relaxed problem. 
For v = 1, 2, . . define the stopping times 

t v ' 1 {lo) = inf{s | X s (lj) > i/v) i = l...,v, 
where X s (tu) = X)t=o x *( aj )- -^ ie strategies 



1 ifT»> i {u)=t, 

otherwise 



are feasible in the original problem. It suffices to show that the convex combi- 
nations 



' v 

i=l 

converge to x in the weak topology. By construction, 

s 

X s » := Y, <M = SU P{* I */" < e [X t (u>) - l/u,X t (w)], 

4=0 

so that X\ — > X t and thus x v t — > x% almost surely. Since x v and x are all 
contained in the unit ball of N°° , we have 

T T 

E^2 x t v t ->• Ej2 x tv t VweA/" 1 , 

(=0 t=0 

by the dominated convergence theorem. 
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Remark 4. The above duality frameworks suggest computational techniques 
for estimating the optimal value of the primal problem. The dual objective in 
Example^ is dominated for every y G M. q + by 



£> t (x tj w)<oj 

t=0 J 



(5) 



~g(y):=E inf \ V[/ t (a; t , w) + yt(u) ■ F t (x t , u)} 

L t=0 

If x' G W is feasible in the primal problem, we get for every y G Ai q + 

T 

~9{y) < Ej2[ft(x' t (uj),uj) + y t (Lj) ■ F t (4(u),u)] 

t=0 

t=0 t-0 ) 

T 

<Ej2ft{x' t (u),u). 



Minimizing over all feasible strategies x' G Af shows that ([5|) lies between g{y) 
and the optimum primal value <p(0). When ip is closed, we thus get that tp(0) — 
swp yeM i^g(y). The problem of finding the infimum in ([5]) can be seen as a 
deterministic version of the primal problem augmented by a penalty term in the 
objective. 



In the case of Example^ §5§ can be written for every y G M. q + 



an 



g(y) = E inf \ ^[-Z t x t + y t (x t - 1/(T + 1))] 



t=0 
T 



E '^L \ - Z t )x t - y ] 



t=0 ) 



This is the dual representation for optimal stopping obtained by Davis and 
Karatzas US}/ . This was used by Rogers \4^j ( see a ^ so Haugh and Kogan U9\j ) 
in a simulation based technique for computing upper bounds for the value of 
American options in complete market models. The technique is readily extended 
to the more general problem class of Example [H The technique can be further 
extended using the following. 

The cost of the nonanticipativity constraint on the strategies has been stud- 
ied in a number of papers; see e.g. Rockafellar and Wets 38 for a general discrete 
finite time framework as well as Wets [IS], Back and Pliska [2 , Davis [TT] and 
Davis and Burnstein [12] on continuous-time models. The cost can be described 
in terms of dual variables representing the value of information. The following 
derives a dual representation in the framework of Section [2] 
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Example 5 (Shadow price of information). Let h be a convex normal integrand 
and consider the problem 

minimize Ih{x). (6) 

xejV 

This can be seen as the primal problem associated with the normal integrand 

f(x, u, uj) = h(x + u,u). 

The value function <p(u) corresponds to adding a general Tt -measurable vector 
u t to each Xt in ©. We get 

l(x,y,uj) — inf {h(x + u, uj) — u ■ y} 

T 

= inf {h(w, uj) - y^(w t - x t ) ■ y t } 

w£R m * — ' 

t=Q 

T T 

= ^2 x t ■ Vt - sup {^2 wt-yt- h(w, uj)} 
t=o weRm t=0 

T 

= ^2 x fVt- h*(y,uj). 



t=o 



As long as there is a y £ L q such that Ih* (y) < oo, this satisfies the conditions 
of Theorem^ with r = p so that 

T 

g(y)= inf. E(S^x t (uj) ■ y t {uj) - h*{y{uj),uj)} 

t=0 

= {-Eh*(y(uj),uj) ify±j\fP, 
1 — oo otherwise. 

By Theorem^ 

(c\tp)(0)= sup -Eh*(y(uj),uj) 

y-LJ\fp 

T 

= sup E inf {h(x,ui) -S^ x t ■ yt(uj)}. 

The infimum in the last expression differs from the original problem in that the 
information constraints have been replaced by a linear term. This can be used 
to compute lower bounds for the optimal value using simulation much like in 
Rogers fJW and Haugh and Kogan \19]j in the case of optimal stopping problems; 
see Remark [JJ Rockafellar and Wets J38f gave sufficient conditions for the 
existence of a y G Ai 1 such that tp(0) = g(y) in the case of bounded strategies; 
see also Back and Pliska J^jj for a continuous-time framework with a special class 
of objective functions. 
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The following problem format is adapted from Rockafellar and Wets [H] . It 
has its roots in calculus of variations and optimal control; see Rockafellar [35] . 

Example 6 (Problems of Bolza type). Let n t = d and consider the problem 

T 

minimize E > L t (x t (uj), Ax t {uj),uj), (7) 

t=0 

where Ax t := Xt — x t ~\, X-\ := and each L t is an J- t -measurable normal 
integrand on M. d x R d x £1. This fits our general framework with 

T 

f(x,u,u) = y]L t (x t ,Axt + u t ,u), 

t=0 

where X-i :— and u — (uq, . . . ,Ux) with Ut G K d . Indeed, (J7J) (|T[) with 
u = 0. We get 

T 

l(x,y,u) = inf } [L t (x t , Ax t + u t , u) - ■ yt] 

t=0 
T 

= inf y~"[Lt(x t ,Vt,oj)-(v t -Axt)-yt] 

t=Q 

T 

t=0 
T 

= ■ A y*+i - H t ( x t , yt , w)] , 

t=0 

where yr+i '■= and Ht is the Hamiltonian defined by 

H t (x t ,yt,w) = sup {v t ■ y t - L t (x t ,v t ,uj)}. 

Thus 

T 

g(y) = inf EY] [Ax t ■ y t - H t (x t ,y t )] ■ 

x£N z — ' 

t=0 

By Jensen's inequality, g(y) < g(Try) where tt denotes the projection (yt)t=o ^ 
(Etyt)J— . Consequently, when maximizing g, we do not loose anything if we 
restrict y to the space M q of adapted q-integrable processes. Moreover, ifuE M p 
we have (u,y) — (u,iry), so that 

(c\ip)(u)= sup{(u,y) +g(y)} = sup {(u, y) + g(y)}. 

Assume now that there is an x £ J\f p such that EH t (xt,y) < oo and that 
(xt,u>) i — y —H t (xt,y(ui),ui) are Tt-measurable normal integrands for every y £ 
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Af q . We then get from Theorem^ the law of iterated expectations (see e.g. \47\ 
Section II. 7]) and Theorem^]] that for every y £ J\f q 

T 

g(y) = inf. fiV [-x t ■ Ay t+1 - H t (x t ,y t )} 

x£N p L — ■* 
t=0 

T 

= in /, E J2 \- x * ■ E ti A Vt+i] ~ H t (x uVt )] 

x£N p * — ' 
t=0 

T 

= in „ f [-XfE t [Ay t+1 ]-H t (x t ,y t )} 

t=0 
T 

= -E/, sup sup [x t ■ E t [Ay t+1 ] + v t ■ y t ~ L t (x t ,v t )} 

T 

= -Ej2L* t (E t [Ay t+1 ],y t ). 
t=o 

The dual problem thus looks much like the primal except that the (forward) 
difference term enters the integral functional through the conditional expectation. 

The above formulation of the Bolza problem was inspired by its continuous- 
time analogs. In the present discrete-time setting, the primal objective can be 
written as 

T 

minimize Ey Lt(xt(io),Xt-i(io),Lu) 

t=0 

for the normal integrands L t (xt, Xt-i,uj) := L(xt,xt — Xt-i,uj). This format 
covers the stochastic extensions of the von Neumann-Gale model studied e.g. in 
Dempster, Evstigneev and Taksar [T5] . 

4 Examples from mathematical finance 

Convex duality has long been an integral part of mathematical finance. The 
case of American options was already discussed in Remark 0] above. Perhaps 
the most famous instance is the "fundamental theorem of asset pricing" which, 
in perfectly liquid market models, relates the existence of an arbitrage oppor- 
tunity with that of an equivalent martingale measure for the underlying price 
process; see Delbaen and Schachermayer [14] for a comprehensive treatment 
of the perfectly liquid case and Kabanov and Safarian [35] for extensions to 
markets with proportional transaction costs. Other instances of convex duality 
can be found in problems of portfolio optimization or optimal consumption; see 
e.g. Cvitanik and Karatzas [5], Kramkov and Schachermayer [5S] or Karatzas 
and Zitkovic |25j . Biagini [6] reviews utility maximization in perfectly liquid 
market models. Klein and Rogers [26] propose an abstract duality framework 
that unifies several earlier ones on optimal investment and consumption under 
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market frictions. Several instances of convex duality in the financial context 
can be found in Follmer and Schied [17 who give a comprehensive treatment of 
the classical perfectly liquid market model in finite discrete time; see Examplc[7] 
below. 

We will show that, in finite discrete time, many duality frameworks in math- 
ematical finance are instances of the abstract duality framework of Section [5J 
Moreover, our framework allows for various generalizations of existing financial 
models. We will study financial problems by following Kabanov [23J in that 
none of the assets is given the special role of a numeraire. Instead, all traded 
securities are treated symmetrically and contingent claims, consumption etc. 
take their values in the space of portfolios. This setting covers more traditional 
models where trading costs and claims are measured in cash; see Example [7] 
below. 

Consider a market where d securities are traded over finite discrete time 
t = 0, . . . , T. At each time t and state lu G 57, the market is described by two 
closed convex sets, Ct(w) c M. J and D t {uj) C 1R J both of which contain the 
origin. The set Ct(w) consists of the portfolios that are freely available in the 
market at time t and D t (ui) consists of the portfolios that the investor is allowed 
to hold over the period [t, t + 1). For each t, the sets C t and D t are assumed to 
be J^-measurable. If Ct(u>) are polyhedral cones and D t (u>) = M. d (no portfolio 
constraints), we recover the model of [25] . 

A contingent claim process (with physical delivery) is a financial contract 
specified by an adapted E J -valued process u — (u t )f =Q . At each t = 0, ...,T, 
the seller of the claim delivers a (possibly state dependent) portfolio Ut to the 
buyer. Traditionally, financial mathematics has studied contingent claims that 
have only one payout date. This corresponds to ut — for t < T. In real markets 
with portfolio constraints, it is important to distinguish between payments that 
occur at different points in time. We refer the reader to [30l [31] for further 
discussion of the topic in the case of claims with cash-delivery. 

A trading strategy x G Af superhedges a claim process u G J\f if 

Ax t +u t e C t , x t € D u t = 0,...,T, x T =0 (8) 

almost surely. Here and in what follows, we always set x_i = 0. Superhedging 
is the basis of many results in financial mathematics. Even if superhedging is 
not quite feasible in many practical situations, it turns out to be a useful notion 
in studying more realistic approaches based on risk preferences. 

Example 7 (Cash delivery). Most contingent claims in practice give payments 
in cash. If cash is represented by the asset indexed 0, a claim with cash delivery 
has Ut = (uj , 0, . . . , 0) where u° is a scalar process. In this case, it is convenient 
to specify the market model by 

C t {uj) = {(z°, z) G R d | z" + S t (z,u) < 0}, 

where the function St{z,uj) represents the cost in cash of buying a portfolio 
z G at time t in state uj. The set Ct is Tt-measurable as soon as St is 
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an Tt-measurable normal integrand on R d_1 x Q. Such a model with portfolio 
constraints has been studied in \3(K \31^ . The budget constraint Ax t + u% G Ct 
can now be written as 

Az f ° + St(Az t ) + u° t < 0. 

// there are no constraints on z° (the position on the cash account), we 
can substitute out the variables z® for t — 1, ...,T to write the superhedging 
condition as 

T T 

5>° + £S t (A* t )<*°; (9) 

see \3(A Example 3.1]. In this setting, there is no need to discriminate between 
payments at different points in time. If, moreover, the functions St{-,uj) are 
linear so that St(z,uj) — s t (uj) ■ z for an adapted price process s — (st), we can 
rearrange terms in ^ to write it in terms of a "stochastic integral" as 

T T 
t=0 t=Q 

This is the traditional formulation of the superhedging problem. It is based on 
the assumptions that, the contingent claim gives payments in terms of a perfectly 
liquid asset and that assets can be traded without a cost. In practice, however, 
neither of the assumptions holds. 

The following example gives a dual characterization of the set of claims that 
can be superhedged at zero cost. Such characterizations are used in various 
results in mathematical finance; see Examples [9"1 and 1121 below. 

Example 8 (Consistent price systems). The superhedging condition (|5]) can be 
studied in our general duality framework with n t = d, m = (T + l)d and 



f(x,u,u) 



if Ax t + u t e C t (uj), x t G AM, x T = 
+oo otherwise. 



Indeed, we then get ip = Sc, where 

C = {u E L p \ 3x e J\f : Ax t + u t £ C t , x t € A, x T = 0} 

is the set of (not necessarily adapted) claim processes that can be superhedged 
at zero cost. This fits the framework of Example with 



L t (x, u,u)) = 



if x € AM an ^ u C*Mj 
+oo otherwise, 



where ArM : ~ {0}- Since 

L*{v,y,uj) = <T Dt ( u) (v) + a Ct ( u )(y), 
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we get for every y G J\f q 



9{y) = -EY^[a Dt (E t Ay t+1 ) + a Ct (Vt)], 



t=o 

where <7d t = 0. In the unconstrained case where A = R d for t = 0, . . . , T - 1, 
we have ajj t — <5{o} so that g(y) = — oo unless y is a martingale and thus, we 
recover \32\ Lemma 1^.3]. When C and D are conical, we have g — —Sjj, where 

V = {yeAf« | E t Ay t+1 G D* t , y t G C t *} 

and C*(w) and D^{uj) are the polar cones of Ct{ui) and D t {uj) respectively. The 
elements ofT> are called consistent price systems for the market model (C,D). 
The notion of a consistent price system was introduced in Kabanov [23] for the 
case of a polyhedral conical C and D = M. d . 

Much of trading in financial markets consists of exchanging sequences of 
cash-flows. In a typical situation, one exchanges a claim process u G J\f p for a 
multiple of another claim process p G J\f p - the premium process. Traditionally, 
financial mathematics has been mainly concerned with the special case where 
Pt = for t > and ut = for t < T . The best known application of this 
special setting is the pricing of European options. Due to portfolio constraints, 
however, premiums as well as claims are often paid over multiple points in time. 
Examples include swap contracts as well as various insurance contracts where 
premium payments are made throughout the life of the contract. 

The superhedging cost of a claim process u G Af p in terms of a premium 
process p G M p is defined as 

f{u) = inf{a | u — ap G C}, 

where C is the set of claim processes that can be superhedged with zero cost; 
see Example [5J The special case where claims and premiums are paid in cash 
has been studied in [51] , The following addresses the general case of "physical 
delivery" . 

Example 9 (Pricing by superhedging) . The superhedging cost is the value func- 
tion in our general framework with 

J a if Az t + u t - ap t G C t (w), z t G AM, z T = 
I +oo otherwise, 

where xq — (zq, a) and xt — z t for t — 1, . . . , T. We have assumed for simplicity 
that Tq = {0, ft} so that zq is deterministic. Alternatively, we could introduce 
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a new decision stage at time t = — 1 with a;_i = a and T-\ = {0, f2}. We get 
l(x, y, uj) = inf {f(x, u,u>) -u-y} 

= inf {a-u-y\ Az t + u t - ap t G C t (u), z t G D t (u>), z T = 0} 



w t G C t (u}), z t G D t (u)), z T = 



inf < a + y^(Az t -w t - ap t ) ■ y t 
(. t=o 

\ a + ELo( Az * - apt) - yt- ELo a c t (u){yt) if z t G and z T = 0, 

1 +oo otherwise 

- Ef=of>* • y«) - E^To 1 z * • A y<+i - E^o^ctM^*) e AM, 

I +oo otherwise. 



This satisfies the assumptions of Theorem [3] with r — oo so 



g(y) = inf < E 



a 



z t G D t 



T T-l T 

(1 - ^P* ' ^) ~ z fAyt + i -^<rCt{u){Vt) 

t=0 t=0 t=0 

inf 26 AT'» {--B [ELV z « ' E t Ay t +i + EtLo cr c , t (yt)] | z t e a} if EJ2j =0 Ptyt = 1 



— oo 
— oo 



lu=o a D t (u>)(E t Ayt+i) + Tl=o a c t (u)(yt) PEtoft • y* = 1, 

ot/ierTOse. 



where the last equality comes from the interchange rule in Theorem [7J T7ws 
expression corresponds to \3(A Lemma 7.1] which addressed contingent claim 
processes with cash delivery. By Jensen's inequality, 

Eto-c t (u){yt) > o- Ct ( u )(Etyt), 

so that g(y) < g(iry), where it denotes the "projection" (yt)f— i— > (Etyt)f = Q. 
This implies that, for adapted claims u G M p , 

{c\ip){u)= sup{(w,y) +g(y)} 
y eLi 

= sup {(u,y)+g(y)}, 

where Af q denotes the set of adapted p-integrable processes y = (yt)t=Q. This 
corresponds to [SI, Theorem 10] on claims with cash- delivery. Closedness con- 
ditions will be given in Corollary \10\ below. If C and D are conical, the above 
formula can be written as 



(c\ip)(u) = sup i 



u t -y t 



EY,Pfyt = i] 

t=0 ) 



where T> is the set of consistent price systems defined in Example^ 
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In the case of classical perfectly liquid markets, we can write the above 
results in a more familiar form. 

Example 10 (Martingale measures). Consider Example^in the classical per- 
fectly liquid case, where D = M. d and 

Ct(u) = {(x°, x 1 ) | x° + 8t{u) ■ x 1 < 0}. 

We get Ctito)* = {(y ,?/ 1 ) \ y > 0, y 1 = st(ui)y }, so the set of consistent price 
systems becomes 

V = {y G N q | E t Ay t+1 = 0, y° > 0, y\ = s t y° t }. 

Consider now Example 03 in the case where pt = for t = 1, . . . , T and po = 
(1, 0, . . . , 0). This corresponds to the classical pricing problem where the claim u 
is exchanged for a cash payment at time t — 0. The condition E J"^_ pt ■ yi = 1 
now means that yg = 1. If u is a claim process with cash delivery, i.e. u t — 
(it( , 0), we get 



T 



{c\y){u)= sup <^ Eyu t y° 



t=0 



E t Ay t+1 = 0, y° > 0, y] = s t y° t , y° Q = 1 



T 



sup E^^ul 



t=o 



where M. q (s) denotes the set of probability measures Q under which the price 
process s is a martingale and whose density dQ/dP is q-integrable. Indeed, it 
follows from the law of iterated expectations that the densities of such measures 
correspond to the random variables y\ above. For claims with — for t < T 
we obtain the classical dual representation of the superhedging cost. See for 
further discussion and references. 

We end this section with a model of optimal consumption problems in the 
general illiquid market model. 

Example 11 (Optimal consumption). Consider the problem 

T 

maximize Ey U t (c t ) 

t=0 

subject to Ax t + c t G C*, x t 6 D t t = 0, . . . ,T, 

where '■= {0} and U t is an ^-measurable concave normal integrand on 
M. d x il. This represents a problem of optimal consumption where possibly all 
traded assets can be directly consumed. To model situations where some of the 
assets cannot be consumed, one can set Ut(c,w) = — oo for c outside of the 
feasible consumption set. Defining C as in Example^ we can write the problem 
concisely as 

T 

maximize E 1 ^"^ U t (c t ) over c G C. 
t=o 
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This is the primal problem of Example^ in the case 
L t (x,v,Lo) 



m£ CteR d{-U t {c t ,uj)\v t + c t S C t (uj)} if x t <E D t (w), 
+00 otherwise. 



By \4°2\ Proposition H.J^l], L t is an J- \- measurable normal integrand as soon 
as it is lower semicontinuous in {x,v). Conditions for lower semiconinuity, in 
turn, can be obtained by pointwise application of \34\ Theorem 9.2]. It is easily 
checked that 

L*(v,y,u) = o- Dt{u) (v) + (T Ct ( u )(y) - U*{y,u), 

where 

U?(y,u})= inf ' {c-y -U t (c,w)} 
is the conjugate of Ut in the concave sense. If C and D are conical, we get 



Kv) 



'Ej:to u t(yt) ify^v, 

— 00 otherwise, 



where T> is the set of consistent price systems defined in Example [21 The dual 
problem can then be written in the symmetric form 



maximize E U^(yt) over y G T>. 



The dual pair of optimization problems above can be seen as a generalization 
(in discrete time) of the optimal consumption duality framework of Karatzas 
and Zitkovic ]25]j where the numeraire asset was consumed in a perfectly liquid 
market model in continuous time. 



5 Some closedness criteria 

Much of duality theory in convex analysis has been concerned with optimality 
conditions and the attainment of dual optimum. Dual attainment is equivalent 
to the subdifferentiability of the value function (p at the origin, which in turn 
is implied by continuity; see [361 Section 7]. In operations research, several 
"constraint qualifications" have bee proposed to guarantee the continuity of tp 
at the origin. Unfortunately, such conditions fail in many infinite dimensional 
applications. In order to get the mere absence of a duality gap, it is sufficient 
(as well as necessary) that tp be proper and lower semicontinuous at the origin. 
In this section, we outline techniques for establishing the lower semicontinuity 
of tp and the attainment of the primal optimum. 

The traditional technique for achieving lower semicontinuity of tp would be 
to introduce a topology on (an appropriate subspace of) J\f, to show that // 
is lower semicontinuous and to impose inf-compactness conditions on If with 
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respect to x; see e.g. [321 Theorem 1.17]. This is essentially the "direct method" 
in calculus of variations for verifying the existence of a solution to a minimiza- 
tion problem; see e.g. pQ. As long as the topology is strong enough to imply 
almost sure convergence of converging sequences, the lower semicontinuity of // 
often follows from Fatou's lemma and pointwise lower semicontinuity of normal 
integrands. The inf-compactness property, on the other hand, is often obtained 
with Alaoglu-type arguments provided the topology is weak enough. In particu- 
lar, ip is a(L°°, L 1 )-closed when the feasible set is L°°-bounded locally uniformly 
in u. This result applies already to many problems arising in practice and, in 
particular, to the optimal stopping problem in Example |U 

In some applications, the compactness condition does not hold. In the convex 
setting, the following version of a theorem of Komlos' [27] can often be used as 
a substitute. 

Lemma 4 (Komlos' theorem). Let (x u )^L 1 be a sequence in L°(f2, J 7 , P; R") 
which is almost surely bounded in the sense that 

sup|x I/ (w)| < oo P-a.s. 

Then there is a sequence of convex combinations x v G coja^ | [i > u} that 
converges almost surely to an M™ -valued function. 

Proof. See Delbaen and Schachermayer [14] or Kabanov and Safarian [22]. □ 

Different versions of the Komlos' theorem have long been used in calculus 
of variations; see e.g. Balder [3] or Schachermayer [33] for an application to the 
classical perfectly liquid market model in Example [TOl 

The almost sure boundedness in Lemma |31 can sometimes be obtained by 
pointwise application of classical finite-dimensional boundedness conditions on 
directions of recession. Given a convex set C, we will denote its recession cone 
by 

C*°° = {z | x + az G C, VieC, a > 0}. 

By [3U Theorem 8.4], a closed convex set C in a finite-dimensional space 
is bounded if and only if C°° = {0}. The proof of this result is based on 
the classical Bolzano- Weierstrass theorem on converging subsequences in finite- 
dimensional spaces. The following simple modification of |24[ Lemma 2] general- 
izes the finite-dimensional Bolzano- Weierstrass theorem to the present stochas- 
tic setting. 

Lemma 5. For an almost surely bounded sequence (x v ) < £L 1 in M there exists a 
strictly increasing sequence of Tt -measurable integer-valued functions (r u ) and 
an x £ J\f such that 

x T — > x 

almost surely. 
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Proof. Applying [2H Lemma 2] to {xq)'^L 1 we get an .Fo-measurable random 
subsequence Tq such that Xq° —> xo for an xo G L°(Q, !F , P; M. n °). Applying 
[24l Lemma 2] next to (x]° we get an J 7 ! -measurable subsequence t" of Tq 
such that x] 1 —> x\ for an x% G -L°(0, J 7 !, P; K ni ). Since Xq° — > xo we also have 
Xq 1 — > Xo- Extracting further subsequences similarly for t = 2, . . . , T we arrive 
at the conclusion. □ 

Sequences of the form (x T )^? =1 in the above lemma are called random sub- 
sequences of the original sequence (x u )^L 1 . 

If C : fl ^ M™ is a closed convex-valued ^-measurable mapping, then 
C°°(<jj) := C(uj)°° defines an ^-measurable mapping whose values are closed 
convex cones; see [42] Exercise 14.21]. The following result generalizes (34] The- 
orem 8.4] to stochastic models in finite discrete time. The proof follows the 
inductive argument in the proof of [32[ Theorem 3.3] with some simplifications. 
Theorem 3.3 of [32] deals with Example [8] in the case D t = R d and its proof 
builds on earlier techniques developed for conical models of financial markets; 
see e.g. [45] or [22] . 

Theorem 6. Let C : f2 =} K™ be closed convex-valued and J 7 -measurable. Every 
sequence in the set C — {x G M \ x G C a.s.} is almost surely bounded if and 
only if {x G N \ x G C°° a.s.} = {0}. 

Proof. If the recession condition fails, then C contains a half-line so it cannot 
be a.s. bounded. To prove the converse, we may assume that G C almost 
surely. Indeed, if C is empty, there is nothing to prove. Otherwise, we take any 
x G C, set C(tjj) := C(w) — x(oS) and note that the translation does not affect 
the recession cone of C or the almost sure boundedness of C. 

We use induction on T. Let T > and assume first that the claim holds for 
every (T — l)-period model. Let (x u )^ =1 C C and consider the following two 
complementary cases. 

Case 1: p(w) := sup |xg(o;)| < oo almost surely. Let 

M := {{ Xl )J =1 \x t e L°(n,T t ,P;« nt )}, 
Ci(w) := {(xi, ... ,£t) | 3x e jo(w)B : (x , . . .,x T ) G C(u>)}. 

By Proposition 14.11(a) and Proposition 14.13(a) of [42], C\ is ^-measurable 
and, by [33] Theorem 9.1], it is closed and convex- valued with 

C?{lj)={(x 1 ,...,x t )\(0,x 1 ,...,xt) GC°°(u;)}. 

Our assumption thus implies that {x G Afi \ x G P-a.s.} = {0} so the 
sequence {x\, . . . , Xj) is almost surely bounded by the induction hypothesis. 

Case 2: the set A = {uj G SI | sup |a;o(w)| = oo} has positive probability. Let 
a l/ = xa/ max{|a;o|, 1} and x u = a v x v . Passing to an J^-measurable random 
subsequence if necessary, we may assume that a"\0 almost surely. Since a v 
are J^-measurable, x v G M . We also have that 

x v ea»C 
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and \xq\ < 1 almost surely. Since a" < 1 and £ C we get a^C C C, by 
convexity. We are thus in the same situation as in case 1, so {x v )^ =1 is almost 
surely bounded. By Lemma [51 there is an J^-measurable subsequence t u such 
that (x T )%L-y converges almost surely to an x € AT. By [MJ Theorem 8.2], 

ieC°° 

almost surely, so x = by the assumption. This contradicts the positivity of 
P{A) since on A, we have 1 so that |a;o(w)| = 1- 

It remains to prove the claim for T = 0. This can be done as in Case 2 above 
except that now we do not need to refer to Case 1 for the boundedness of (x u )^L 1 
(This is essentially the finite-dimensional argument in [MJ Theorem 8.4]). □ 

With Theorem [6j we can generalize finite-dimensional closedness results to 
the present stochastic setting much like [MJ Theorem 8.4] was used in Section 9 
of [MJ. Orthogonal projections, which are central in the arguments of [MJ 
Section 9 ] , are not well-defined in the space AT but the following lemma (whose 
origins can be traced back to Schachermayer [44]) can be used instead. 

Lemma 7. Let L : Q W l be an J- -measurable mapping whose values are 
linear. For each t = 0, ...,T, there is an J- \ -measurable linear-valued mapping 
L t : ft ^ W lt such that 

{x t £L°(n,F t ,P;R nt )\3z £N : (0, . . . , 0, x t , z t+1 . . . , z T ) E L} 

= {x t e L a (n,T u P;R nt )\x t e L t P-a.s.}. 

Proof. It suffices to prove the claim for t — since otherwise we can replace L 
by the set-valued mapping L(u>) — {(xt, ■ ■ ■ , xt) \ (0, . . . , 0, Xt, ■ ■ ■ , xt) G L(co)} 
which is also closed- valued and J 7 - measurable, by Proposition 14.11(a) and 
Proposition 14.13(a) of [32] . For any ^-measurable closed-valued mapping S : 
=4 R fc , there is an J^-measurable mapping T t S whose T% -measurable selectors 
coincide with those of S. Indeed, it suffices to check that the proof of [20] Theo- 
rem 3.1] (and the proofs of the lemmas used in it) goes through in the case p = 
with the norm replaced by the metric d(x , x 2 ) = E min{|a; 1 (cLi) — x 2 (u>)\, 1}. We 
define L t recursively by Lt — Tj-L and 

Lt = r 4 P f L i+ i, 

where (P t L t+1 )(uj) := {(x , . ..,x t ) \ 3x t +i e M" t+1 : (x , ■ . . ,x t +i) G L t+1 (uj)}. 
The mapping P t L t+ i is J^+i-measurable (see [42j Proposition 14.13(a)]) and 
linear-valued. It is clear that if xo belongs to the set on the right side then 
a;o € Lq almost surely. The reverse direction follows from repeated application 
of the theorem on measurable selections; see e.g. [321 Corollary 14.5]. □ 

The following can be seen as a generalization of [MJ Theorem 9.1] to our 
stochastic setting. 
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Theorem 8. Let C : O =! R" x R m be a closed convex-valued T '-measurable 
mapping such that {x G AT \ (x, 0) G C°° a.s.} is a linear space. Then the set 

C = {u e L p \ 3x e J\f : (x,u)eC a.s.} 

is o(L p , L q )-closed. 

Proof. Let L = {x € R" | (x,Q) G C°°{uj) n (-C°°(cj))} and define L t as in 
Lemma [7J We have 

C = {u e L p 1 3x e N -. x t e Lj-, (x,u) g C a.s.}. 

Indeed, let :eq(w) be the pointwise orthogonal projection of xq(ui) to Lq(uj). By 
definition of L t , there is an extension x° G A/" of such that 0) G C° almost 
surely. Defining £° = x — x°, we have ig G Lq 1 and (x°,u) G C almost surely. 
Repeating the procedure for t = 1, . . . , T we arrive at an x T G A/" with xj G 
and (x T , u) G C almost surely. 

Assume first that p = 1. Since C is convex it suffices to prove t(L 1 ,L°°)- 
closedness. Since r{L , L°°) is the norm topology, it suffices to verify sequential 
closedness. So assume that (u")^ =1 C C converges to a u in norm and let 
x" G A/" be such that G and (s^u") G C. Passing to a subsequence, 
we may assume that u v — > u almost surely, so that the measurable function 
p(uj) := sup„ |u"(u;)| is almost surely finite. Each (x u ,u") thus belongs to the 
set 

C p = {(x, u) eAf x L°\ (x t ,u t ) G C p a.s.}, 

where C p (uj) = {(x,u) \x t G L^(u), u G p(w)B, (x,u) G C(cj)}, By [34, Corol- 
lary 8.3.3], 

C?(u>)={(x,0)\xt€Li-(u), (x,0)erM}, 
so, by the linearity assumption, 

{(x, u)eJVxL°\ (x, u) G a.s.} 

= {(x, 0) G M x L° | Xt G L^, (a, 0) G C°° n (-C°°) a.s.} 
= {.x G N | a; t G L^, x eL t a.s.} x {0}, 

which equals {0,0}, by the definition of L t . By Theorem [6j the sequence 
[x v , u^)^! is then almost surely bounded. By LemmaSJ there is a sequence of 
convex combinations (x v ', u")^! that converges almost surely to a point (x, it). 
We have u G C since C is convex and closed- valued and u — u since the original 
sequence was convergent to u. 

Now let p G [1, oo] be arbitrary. We have C = {u G LP \ Au G C 1 }, where C 1 
denotes the set C in the case p = 1 considered above and A : a(L p , L q )) — > 
(L 1 , ct(X 1 , is the natural injection. Since ^4 is continuous, the a(L p ,L q )- 

closedness of C follows from the cr{L l , L°°)-closedness of C 1 . □ 

The above result can be seen as a lower-semicontinuity result for the value 
function ip is situations where it takes the form of an indicator function. The- 
orem [6] and Lemma [7J allow the verification of the lower-semicontinuity of ip in 
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more general situations as well. This will be the subject of a separate article. 
We end this paper by showing how Theorem [8] yields some fundamental results 
in financial mathematics. An early application of recession analysis to portfolio 
optimization can be found in Bertsekas [5]. 

Example 12 (The no-arbitrage condition). Consider the market model studied 
in Section^ The model is said to satisfy the no arbitrage condition if 

CnA/"+ = {0}, (10) 

where C is the set of claim processes that can be superhedged without a cost 
(see Example^) and N+ is the set of nonnegative adapted processes. The no- 
arbitrage condition means that it is not possible to superhedge nontrivial nonneg- 
ative claims by costless transactions in the financial market. Condition (|10[) was 
studied in [32], where it was related to more traditional formulations of the no- 
arbitrage condition. In particular, in the classical perfectly liquid market model 
(see Examvle \10\) with strictly positive market prices, (|10[) is equivalent to the 
classical no-arbitrage condition expressed in terms of claims with cash-delivery 
and a single payout date; see e.g. In markets with portfolio constraints, 

however, the above formulation in terms of claims with multiple payout dates is 
more meaningful; see ]3(A [31^ . 

It was shown in Schachermayer \44\ Section 2} that, in case of the uncon- 
strained linear market model in Examvle \l(A the no-arbitrage condition implies 
thatC is closed. Theorem\B\yields a simple proof of this important result. Indeed, 
we now have 

C(u) = {(x,u) G W x R m | Ax t +u t e C t (u), x T = 0}, 

where Ct(bS) — {(a; ,^ 1 ) | x + st(co) ■ x 1 < 0}. Since C(uS) is conical, we have 
C°°(lu) — C(w) and the condition in Theorem^ can be written as 

{x G M | Ax t <E C t , x T — a.s.} = {x G M \ Ax t G C t °, x T = a.s.}, 

where (oS) :— Ct(ui) D (— Ct(w)). If this condition fails, there is an x G J\f 
such that Axt G Ct and xt — almost surely but for some t and a set A G 
Tt of positive probability, Axt (fc C t °. Since Ct(uj) is a half-space, we have 
Ct(uj) \ C t °(w) = int Ct(tj). Given a nonzero vector e G , the Tt-measurable 
nonnegative variable 

e(u>) :— max{a | Axt(oj) + ae G Ct(u>)} 

is thus strictly positive on A. We then have that x superhedges the nontrivial 
claim process defined by u t (uj) = e(uj)e and u s = for s ^ t. This violates the 
no-arbitrage condition (|10[) . so the closedness condition of Theorem\$\must hold 
under (|TU)) . 

The above argument extends directly to market models with transaction costs 
(without portfolio constraints) provided one slightly strengthens the no-arbitrage 
condition; see 14 5\ \22f for conical polyhedral market models or [321 for a more 
general convex model. 
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The closedness of C combined with the Kreps-Yan theorem (see e.g. [2Tj). 
yields the following famous result of Dalang, Morton and Willinger 9 . The 
proof of the Kreps-Yan theorem is based on separation and exhaustion argu- 
ments. 

Corollary 9 (Fundamental theorem of asset pricing) . The perfectly liquid mar- 
ket model of Example \10\ with a strictly positive market price process s satisfies 
the no arbitrage condition if and only if there is a probability measure equivalent 
to P under which s is a martingale. 

Proof. Assume that the no-arbitrage condition holds. By Example [HI C is 
closed in N p '. The Kreps-Yan theorem then gives the existence of a strictly 
positive y G Af q such that y G C*, where C* is the polar cone of C. It was shown 
in Example [5] that in the case of conical market models, C* equals the set V 
of consistent price systems, which in the case of perfectly liquid market models 
may be identified with martingale measures for s; see Example 1101 Similarly, 
strictly positive price systems y G T> correspond to martingale measures which 
are equivalent to P. 

On the other hand, the existence of a strictly positive martingale measure 
means that there is a strictly positive y G C*. Any nonzero u G C n A/+ would 
then satisfy both E(u ■ y) < and E(u ■ y) > 0, which is clearly impossible. □ 

The following gives closedness conditions for Example |H1 

Corollary 10 (Superhedging cost). Consider Example^ and assume that 

{x e J\f\ Ax G C t °°, x t G L» t °°, xt — a.s.} 

is a linear space and that the premium process p G M p is such that <p(0) > — oo. 
Then the superhedging cost 

<f{u) = inf{a | u — ap G C} 

is closed. 

Proof. The set C of claim processes that can be superhedged at zero cost corre- 
sponds to Theorem [5] with 

C{oj) = {(x,u) G R" x R m | Ax t + u t G C t {oj), x t G D t {oj), x T - 0}. 

By [Ml Corollary 8.3.3], 

C°°(oj) = {(x,u) G R" x R m | Ax t + u t G Cf (w), x t G Df (w), x T = 0}, 

so our assumption means that the closedness condition in Theorem[S]holds. The 
set C is thus closed and, in particular, algebraically (or radially) closed. The 
closedness of ip then follows exactly like in the proof of [311 Theorem 10]. □ 



As shown in Example the first condition coincides with the classical 
no-arbitrage condition in the case of the perfectly liquid market model of Ex- 
ample [TOJ The condition generalizes the closedness conditions given in [3"Tl 
Theorem 18] and in Theorem 13 of Kreher [25]. Relations of the first con- 
dition to certain generalized no-arbitrage conditions have been studied in [291 
Section 7]. The condition on the premium process p is mild. It means that 
the premium is a contingent claim that is not freely available in the market at 
unlimited amounts; see [3T] for further discussion. 

More results on the lower semicontinuity of the value function as well as 
extensions of the presented duality framework to a continuous-time setting will 
be presented in separate articles. 
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